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ABSTRACT 

Background: worldwide, breast cancer is the most common malignancy among women and there is a deep need 
for precise novel methodologies for breast cancer (BC) diagnosis. Major advances in cancer control will be 
successfully achieved with early cancer detection. So, recent trends are going toward using circulating non-coding 
RNA as diagnostic tool for their critical role in cancer detection. 

Aim: retrieve non coding RNA that is mechanistically linked to breast cancer stem cell with validation of the results 
in a group of breast cancer patients versus control groups to evaluate their usefulness as a potential biomarker in 
breast cancer diagnosis. 

Patients and Methods: we retrieved LncRNA that is linked to stem cell differentiation and specific to BC utilizing 
bioinformatics tools. Then we validated this biomarker in serum of 30 patients with BC, 12 patients with benign 
breast lesion and 12 healthy volunteers using RT-qPCR. We evaluate the power of diagnosis of the serum profiling 
system using ROC curve analysis. 

Results: hoxd antisense growth-associated long non coding RNA (HAGLR) had great sensitivity and specificity for 
differentiating BC from patients with benign breast lesion and also from healthy controls. 

Conclusion: the chosen circulatory RNA based biomarker can be used as a potential diagnostic biomarker for BC. 


In addition it could be therapeutic target. 
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INTRODUCTION 


Breast cancer is the most frequent 
malignancy among women worldwide. It can be 
treated when diagnosed at its earliest stage. Every 3 
minutes a female is diagnosed with BC globally, 


5 Saas 1 
accounting to one million cases every year". 


In Africa, there is a limited data about 
breast cancer incidence. The incidence rates for 
breast cancer between African countries vary 
considerably with the highest rates are in Gharbiah 
governorate (Egypt) [6078 cases reported in Egypt 
during the interval (1999-2007) and the lowest 
rates are in Ghana”. 


When taking a clinical decision on BC 
treatment, Oncologists face an exceptionally hard 
task. Such hard task could be easier if there are 
strong diagnostic and prognostic factors, which 
guide the choices of treatment options®.Molecular 
technique, such non coding RNA (ncRNA) 
expression profile, has been used to improve BC 
diagnosis and to evaluate patient outcome and 
response to treatment™. 


Recent evidence suggests that small 
populations of cancer stem cells can modify and 
influence neoplastic cells aggressiveness and 
behavior as well as response to therapy®.Many 
observations prove that BC ability to divide, 
progress and spread is based on this small 
subpopulation of cells with characteristics similar 
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to stem cells, known as breast cancer stem cells 
(BCSCs)®, Several markers have been reported for 
identification of BCSCs in many cancers, such as 
CD44, CD133, CD24, EpCAM, CD47and 
ALDH1”. 


Stem cells undergo two types of cell division 
(symmetric and asymmetric), producing similar stem 
cell colony or more differentiated cells. Moreover, 
stem cells remain in a state of quiescence in the tumor 
environment, facilitating the so-called kinetic 
resistance, whereby these quiescent cells are 
unaffected by agents that can cause DNA damaging 
or radiation in comparison with the more rapidly 
proliferating cells®. Also great drug resistance has 
been described for Cancer stem cells (CSCs) through 


different mechanisms such as drug effluxing™. 


In recent years, IncRNA is emerging as an 
important player in the cancer paradigm. These non 
coding RNAs are often apparently des-regulated in 
many types of human cancers “. Accumulating 
evidences provide a mechanistic insight on how 
IncRNAs regulate important signaling pathways in 
cancer cells at the transcriptional, post-transcriptional 
and epigenetic levels"” In addition to their effects on 
cancer cell growth, cell signaling and survival, 
IncRNA can modulate CSC behavior through the 
expression of pluripotency factors. The identification 
of IncRNAs that are linked to cancer stem cells 
differentiation and self-renewal provides new 
opportunities for cancer diagnosis and therapy. 
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Among the oncogenic IncRNAs is lincRNA- 
HOTAIR (HOX antisense intergenic RNA), which is 
transcriped from the mammalian homeobox C (HOXC) 
gene locus on chromosome 12q13.13. HOTAIR is up- 
regulated in primary and metastatic breast cancer”. 


Aim: the aim of the study is to retrieve non coding 
RNA that is mechanistically linked to breast cancer 
stem cell with validation of the results in a group of 
breast cancer patients versus control groups to 
evaluate their usefulness as a potential biomarker 
in breast cancer diagnosis. 


PATIENTS AND METHODS 
Patients and samples 

Thirty BC patients participated in this study; 
diagnosis is done according to _histopathological 
techniques. BC patients were classified according to 
clinical stages into; 12 (40%) stage I, 13(43.3%) stage II 
and 5 (16.7%) stage II carcinomas using the TNM 
classification American Joint Committee on Cancer, 
2016 and graded according to American Cancer Society, 
2017" We collected venous blood samples from patients 
before any therapeutic interventions, including radical 
mastectomy or modified radical mastectomy, radiotherapy 
and chemotherapy. We analyzed the serum samples of 30 
BC, 12 benign breast lesions and 12 healthy controls 
collected at general surgery department, Ain Shams 
University Hospital from June 2016 to November 2017. 


Serum was obtained from each sample 
through centrifugation of venous blood samples 
collected from participants. All serum samples 
were kept at —80 °C for further analysis. 


Ethical and Approval Statements 


The study was approved by the Ethics 
Board of Ain Shams University, Faculty of 
Medicine, Egypt and an informed written consent 
was obtained from each participant in the study. 


Extraction of total RNA, including ncRNA 


MiRNeasy® RNA isolation mini kit 
(Qiagen®, USA) was used to extract total RNA, 
including IncRNA from sera samples according to 
manufacturer's instructions. This is followed by 
reverse transcription of the extracted total RNA 
into cDNA with a miScriptRT II Kit (Qiagen®, 
USA) following the manufacturer protocol for 
sera/tissue samples) using Hybaid thermal cycler 
(Thermo Electron Waltham, MA). 
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Real time-PCR (qPCR) quantification of RNA 
based biomarker 


LncRNA-HAGLR expression in 
participant’s serum was assessed using RT? SYBR 
Green ROX qPCR Mastermix on Step One Plus™ 
System (Applied Biosystems Inc., Foster, CA). 


GAPDH was used as an internal control. All 
the PCR primers were obtained from (Qiagen®, 
USA). Relative measurement of RNA based biomarker 
expression was figured using the 2^^®™ technique. For 
normalization of raw data we used GAPDH as a 
housekeeping gene as the invariant control for the 
samples and compared with a reference sample. 


The PCR program for Syber green based 
QPCR was as follow: at first, initial activation 
stepat 95°C for 10 min; after that 40 cycles of 
denaturation for 15 sec at 95°C; then annealing for 
30 sec at 55°C; lastly, extension for 30 sec at 72°C. 


The threshold cycle (Ct) value of each 
sample was calculated using StepOnePlus™ software 
v2.2.2 (Applied Biosystems). We used melting curve 
analysis software of Applied Biosystem to analyze 
our results. Amplification plots and Tm values were 
analyzed to affirm the specificities of the amplicons 
for Sybr Green -based PCR amplification. 


Statistics 


We used SPSS 20 to do all statistical 
analyses. Comparisons were done using Mann- 
Whitney, Krausakul Wallis, chi-square test, and one- 
way analysis of variance (ANOVA), as appropriate. 
To investigate the predictive value of chosen RNA 
based biomarker for BC, we used the receiver 
operating characteristic (ROC) curve. We assessed 
the relation between RNA based biomarker 
expression and clinic-pathological parameters. 


P value= level of significance, P> 0.05 non- 
significant, P< 0.05 significant, P< 0.01 highly significant 


RESULTS 


The present study included 54 female subjects. 
They were classified into 3 groups: 


Group1: malignant breast cancer cases: 


(n=30, of mean age 54.94 14.5 years, 
median 56 years and range from 20-81 years). 


Group 2: benign breast cases diagnosed as 
fibroadenoma: 


(n=12, of mean age 46.54 15.3 years, 
median 52.5 years and range from 20-62years). 
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Group 3: healthy normal female individuals: was a highly significant difference between the 
malignant and normal control groups (p <0.01) and 


(U=33). 


But there was no significant difference 
between the benign and normal control groups (P= 
0.544) using Mann-Whitney test (U=61.5)(table 3). 


Table (4): LncRNA-HAGLR Expression in Sera 
Samples in Relation to Clinicopathological Factors 
of Malignant Group (No=30). 


LncRNA-HAGLR expression 
Positive(22) Negative(8) 2:(P) 


1 (12.5% 
ý 7 (87.5% P= (0.935) 
2.078 


5 (22.7% 4 (50.0% 
0.197 
P= (0.657) 


7 (17.3% 4 (50.0% 
2.240 
P= (0.326) 


9 (40.9% 4 (50.0% 
0.085 
P= (0.770) 


(n=12, of mean age 50.7+ 11.6 years, 
median 54.5 years, and range from 23-63 years). 


Table (1): The Age in Different Groups of the Study in Years. 


| Group | no. | Median | Range | Means SD | 
20-81 | 54.94 14.5 


Table (2): Quantitative RT-PCR for Measurement 
of IncRNA-HAGLR Mean Rank of Sera Samples 
in the Malignant Group Compared to Benign and 
Normal Control Groups. 





Clinicopathological factors 


Premenopause (9) 
Postmenopause (21) 
Positive (13) 
Negative (17) 


Menopause 
G0) 

Family history 
G0) 


) 
) 
) 
) 


0.4550 
0.05 - 4.14 
18.15 


BMI 
(G30) 


Overweigh (7) 

Obese (18) 

Past administration (20) 
Never (10) 

Past administration (19) 
Never (11) 


OCT 
G0) 


5 (68.2% 


36.95 
0.34 - 224.41 
37.75 


G0) 


6%) 
0) 
1%) 
o) 
9%) 
o) 
7%) 
1%) 
5%) 

2%) 


Type 
G30) 


2 ) 
(50.0%) 
00.0%) 
2050% 
6(15.0% 
5 (62.5% 
D 3 (37.5% = 
6 (75.0% 0.639 
2 (25.0% 
Mixed (6) 
Others (4) 
Stage(I) (12) 
Stage(II) (13) 
Stage(IID (5) 
Grade 1 (8) 
Grade 2 (20) 7 (87.5% 
Grade 3 (2) 0 (0.0%) 
Positive (16) 5 (62.5%) 0.368 
Negative (14) 3 (37.5%) 
Positive (17) 6 (75.0%) 
2 (25.0%) P= (0.222) 
(12.5%) 0.716 
P= (0.398) 


Negative (13) 


2 (25.0% 
0 (0.0%) 
2 (25.0% 
4 (50.0% 
2 (25.0% 
1 (12.5% 


4.91 
0.85 — 265 
40.63 
24.053 
0.000** 


Krausakul Wallis Test, p (>0.05): not 
GO) 


) 
) 
) 
) 
) 
6 (75.0%) 
) 
) 
) 
) 
) 
) 
significant, “p (<0.05): significant, **p (< 0.01): FETTET PeO eona | 
highly significant 3 Negative (23) 787.5%) 
4 (50.0%) 


Luminal A (11) 7 (31.8%) 
subtype Luminal B (8) 5 (22.7%) 3 (37.5%) 
G0) 


Stage 
G0) 


Grade 
G0) 





G0) 





| Premenopause 9) | 
| Postmenopause (21) | 
| Positive (13) 
| Negative (17) | 
|Overweigh (7) | 
| Obese (18) 
| Past administration (20) | 
|Never(10) 
| Past administration (19) | 
[Never | 
| Mixed (6) 
| Others (4) 
[Stage 2) 
| Staged) (13) | 
| Staged) (5) 
[Grade 18) 
|Grade220) 
| Grade 3) O 
| Positive d6 | 
| Negative (14) 
| Positive) 
| Negative (13) | 
| Positive D 
| Negative (23) 


4.839 


Using real-time PCR; the mean rank level 
P= (0.184) 


for IncRNA-HAGLR RNA in the malignant group 
(Mean rank was 18.15) as compared to benign 
(Mean rank was 37.75) and normal control groups 


X?: Chi Square test,p (>0.05): not significant, *p (<0.05): significant, 
**p (< 0.01): highly significant. 








(Mean rank was 40.63) with high significant 
difference between the three groups (P < 0.01) 
(Table 2). 


Table (3): Quantitative RT-PCR for LncRNA- 
HAGLR Comparison between Different Groups of 
the Study. 


Malignant and 
benign groups 


Malignant and 
normal control 
groups 


Benign and 
normal control 
groups 


Quantitative 
IncRNA- 
HAGLR by 
RT PCR 


Mean rankfor 
IncRNA- 
HAGLR 


(P) 
Mann-Whitney test, p (>0.05): not significant, *p (<0.05): significant, 
**p (< 0.01): highly significant 


33.75 11.63 13.38 
46.5 


P= (0.000)* * 


61.5 
P= (0.544) 


P= (0.000) ** 


By applying post hoc test for IncRNA- 
HAGLR, there was a highly significant difference 
between the malignant and benign groups (p <0.01) 
by using Mann-Whitney test (U=46.5). Also there 


i g R Normal 5 Normal 
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The positivity rate of IncRNA-HAGLR in 
sera samples was estimated among the malignant 
group of the study. 


The positivity rate of IncRNA-HAGLR was 
apparently found to be higher in multipara constituting 
(86.4%) compared to (13.6%) for nullipara. 
Postmenopausal women constituted (77.3%) in relation 
to (22.7%) for premenopausal women. 


Studying positivity rate and family history 
showed that (59.1%) of those expressing IncRNA- 
HAGLR had negative family history compared to 
(40.9%) for those with positive family history. In 
malignant cases percent of normal BMI was 
(22.7%) compared to (22.7%) for overweight and 
(54.5%) for obese women. 


Results also shows that (68.2 %) and (59.1 
%) were using OCT and HT respectively in the 
past, while (31.8%) and (40.9 %) had never used 
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OCT and HT respectively. (63.6%) of malignant 
group were IDC compared to (18.2%) for mixed 
IDC and ILC while other types constitute (18.2%) 
of IncRNA-HAGLR positive cases. 


It was found that (45.5%) of patients were 
at stage (I), (40.9%) were stage II and (13.6%) 
were stage (III),While tumors with grade 3 
constituted (9.1%) compared to (31.8%) for grade1 
and (59.1%) for grade 2. 


The results also shows that IncRNA- 
HAGLR positive cases with negative ER & PR 
represented (50%) compared to (50%) for those 
with positive ER & PR. The study also shows that 
samples with negative Her2/neu represented 
(72.7%) of samples compared to (27.3%) for those 
with positive Her2/neu. 


The correlation between expression of 
IncRNA-HAGLR and different molecular subtypes 
of breast cancer was (31.8%), (22.7%), (40.9%) 
and (4.5%) with Luminal A, Luminal B, Basal and 
Her2/neu expression Subtype respectively. 


No significant correlation was found 
between IncRNA-HAGLR expression and any of 
the investigated  clinic-pathological factors 
(P>0.05) (Table 4). 


DISCUSSION 


Breast cancer (BC) is the most common 
cancer among women worldwide", with an 
estimated 2.4 million new cancer cases diagnosed 
in 2015". In United States, an estimated 252,710 
new cases of invasive breast cancer and 63,410 
new cases of in situ breast carcinoma are diagnosed 
among women in 2017 ®. In Egypt, breast cancer 
accounts for 32 % of total malignancies among 
females “®. It also occupies the second rank of 
total cancer cases (18.3%), 


Discovering new biomarkers has been the 
subject of intense research especially with 
emergence of novel technologies. Major advances 
in cancer control will be greatly used in early 
cancer detection “®. With the need for finding 
biomarkers for early detection of cancer to improve 
prognosis and survival “®”. 


Thus the task of bioinformatics in the cancer 
biomarker discovery is to provide prioritized lists of 
marker candidates aided by availability of 
microarrays and bisulfite sequencing and it is 
anticipated that those key biomarkers identified will 
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both represent early disease and lead to improved 
understanding of tumorgenesis™. 


Breast CSCs appear to be resistant to hypoxia, 
chemotherapy and radiotherapy. Moreover, breast CSCs 
show high tumorigenicity and invasiveness, which are 
important to the occurrence, development, metastasis and 
breast cancer recurrence ”. 


Transcription factors such as OCT4, SOX2, 
NANOG, c-MYC and KLF4, and signaling pathways 
including Hedgehog, WNT, Notch, PDGF, TGF-B 
and JAK/STAT, play important roles in sustaining 
self-renewal capacity in CSCs and therefore provide 
potential targets in the emergence of therapeutic 
strategies”. 


In recent years, numerous studies revealed 
that different IncRNAs could regulate CSCs in many 
types of cancer via various molecular mechanisms, 
which include proliferation, differentiation and self- 
renewal, promotion of metastasis, invasion and 
expecting prognosis and targeted therapies. Up to 
date, HOTAIR, H19, ARSR and UCA1 are the most 
prominent IncRNAs in CSCs. the dysregulation of 
IncRNAs is a potential biomarker in diagnosis, 
prognosis, and target therapy of cancers”. 


The IncRNAsare group of non-coding 
RNAs that regulate gene expression transcriptionally 
and post-transcriptionally The IncRNAs are longer 
than 200 nucleotides and involved in the pathology of 
many diseases including cancers °°. They became a 
subject of interest because of thier role in 
dysregulation in multiple types of human cancers and 
also they can act as prognostic markers and can be 
therapeutic targets ?®. 


HAGLR RNA contains eight exons and its 
transcript is a novel IncRNA. It is transcribed from 
the HOXD cluster on human chromosome 2q31.2 in 
the anti-sense manner. HOXD gene is a member of 
the HOX cluster which regulates organogenesis and 
embryogenesis. HOX gene dysregulation occurs in 
multiple types of cancers””, The HOX genes are the 
key developmental regulators in many processes, 
involving apoptosis, differentiation and receptor 
signaling. Dysregulation of HOX genes is frequently 
implicated in malignancy and plays essential roles in 
oncogenesis and/or tumor suppression. LncNA- 
HAGLR is included in regulating the JAK2/STAT3 
signaling pathway 7*). 


The results of our study revealed that IncRNA- 
HAGLR expression was down regulated in breast 
cancer patients (Mean rank was 18.15) when compared 
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to benign breast lesions (Mean rank was 37.75) and 
normal healthy control individuals (Mean rank was 
40.63) where there was a highly significant difference 
among the three study groups as regards fold change 
(RQ) of serum LncRNA-HAGLR expression (P< 0.01). 


There was a highly significant difference 
between the malignant group and benign group (p 
<0.01) by using Mann-Whitney test (U=46.5), also 
there was a highly significant difference between 
the malignant group and normal control group 
(U=33) and (p <0.01). 


Interestingly, the ROC curve for the IncRNA- 
HAGLR gene expression in results of our study 
revealed a cut off value of < 1.045 to discriminate 
malignant from non malignant cases with a high 
statistical significance (p<0.001) where an expression 
level below the cutoff value is considered to be positive 
because IncRNA-HAGLR_ expression is down 
regulated. Using this cutoff value, 22 out of 30 
malignant patients were positive (< cutoff value 1.045) 
for IncRNA-HAGLR expression (73.3%), 1 out of 12 
benign patients were positive (8.3%) and 1 out of 12 
normal individuals were positive forlncRNA-HAGLR 
expression (8.3%) at (P<0.01). 


Thus LncRNA-HAGLR expression could be 
used as a sensitive biomarker for early diagnosis of BC 
with recorded sensitivity73.3%, specificity 91.6%, 
PPV 91.6%, NPV 73.3% and accuracy 81.4%. 


CONCLUSION 


IncRNA-HAGLR detected by quantitative 
RT-PCR could be used as promising biomarkers for 
breast cancer. 
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